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RNAi LIBRARY 



10 



FIELD OF THE INVENTION 

!£» 

The present invention relates to an RNAi library. More particularly the ^> 
invention relates to the construction of a gene expression library encoding RNA r~ 
capable of suppressing an expressed gene. CX3 

m 

BACKGROUND OF THE INVENTION O 

o 

The introduction of double stranded RNA (dsRNA) into a range of organisms ~< 
induces both a potent and specific gene silencing effect. This form of gene suppression 
by a dsRNA molecule was first observed in Caenorhabditis elegans and given the term 
RNA interference or RNAi (Fire et al 1998). In an attempt to optimise the use of 

15 antisense RNA as a tool for controlling specific gene expression in worms, Fire et ai 
(1998) found that dsRNA was more effective than antisense RNA alone. The dsRNA 
could be generated in vitro (Fire et al 1998) or in vivo (Tavernarakis et al 2000) and still 
mediate gene suppression with high specificity. Subsequent studies have shown that 
dsRNA is an effective inducer of gene silencing in a wide range of eukaryotic 

20 ' organisms and that the mechanism behind this form of gene regulation is most likely 
conserved throughout evolution (Baulcombe, D. C. (1996) Plant Mol Biol 32(1-2), 79-88; 
Lohmanrv ]\ U., Endl, I., and Bosch, T. C. (1999) Dev Biol 2U(1), 211-4; Ngo, H., 
Tschudi, C, Gull, K., and Ullu, E. (1998) Proc Natl Acad Sd USA 95(25), 14687-92; 
Cogoni, C., and Macino, G. (1999) Nature 399(6732), 166 7 9; Kennerdell, J. R., and 

25 Carthew, R. W. (1998) Cell95(7), 1017-26; Schoppmeier, M., and Damen, W. G. (2001) 
Dev Genes Fvol 211(2), 76-82; Baker, M. W., and Macagno, E. R. (2000) CurrBiol' 
10(17), 1071-4; Wargelius, A., Ellingsen, S., and Fjose, A. (1999) Biochem BiophysRes 
Commun 263(1), 156-61). 

The molecular mechanism of RNAi has begun to be deciphered using 

30 biochemical and genetic approaches in different experimerttal systems (Hammond, 
S.M., Caudy, A.A., and Hannon, GJ. (2001) Nat Rev. Genet 2, 110-19). Presently, 
RNAi is postulated to involve both an initiation step and an effector step. During the 
initiation phase, dsRNA is processed by the RNaseELI family nuclease Dicer to produce 
21-23 nucleotide duplex siRNAs (small interfering RNAs). These short stretches of 

35 dsRNA carry 2 nucleotide 3'-OH overhangs that contribute to the efficacy of gene 
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silencing (Elbashir, S.M., Lendeckel, W., and Tuschl, T. (2001) Genes & £tevl5:188- 
200). In the effector phase, these siRNAs are incorporated into a multiprotein complex 
called RISC (RNA-induced silencing complex) that targets transcripts by base pairing 
between one of the siRNA strands and the endogenous mRNA (Hammond, S.M., 

5 Bernstein, E., Beach, D., and Hannon, GJ. (2000) Nature 404: 293-96). A nuclease 

activity associated with the RISC complex then cleaves the mRNA-siRNA duplex thus 
targeting the cognate mRNA for destruction. 

In mammalian cells the use of dsRNA to control gene expression has been 
hampered by the presence of a unique global response mechanism. Mammalian cells 

10 exposed to dsRNA longer than 30 base pairs in length trigger a response mechanism 
involving activation of two key enzymes, dsRNA-activated protein kinase (PKR) and 
2'5' oligoadenylate polymerase/ RnaseL (Kumar, M., and Carmichael, G. G. (1998) 
Microbiol Mol Biol Rev 62(4), 1415-34). The activation of these enzymes leads to a 
cessation of protein synthesis and eventually cell death via apoptosis. It was thus 

15 anticipated that the introduction of long dsRNA would activate this global response 
system. However, studies have shown that in both mouse pre-implantation embryos 
(Svoboda, P., Stein, P., Hayashi, H., and Schultz, R. M. (2000) Development!!** '(19), 
4147-4156; Wianny, F., and Zkrnicka-Goetz, M. (2000) Nat Cell Biol 2(2), 70-5) and 
undifferentiated embryonic stem cells and embryonic carcinoma cells (Yang, S., 

20 Tutton, S., Pierce, E., and Yoon, K. (2001) Mol Cell Biol21(22) f 7807-16; Billy, E., 

Brondani, V., Zhang, H., Muller, U. and Filipowicz, W. (2001) Proa Natl Acad Sci 98, 
14428-14483; Paddison, P., Caudy, A. A., and Hannon, GJ. (2002) Proc. Natl Acad. Sci. 
99, 1443-1448), the use of in vitro generated long dsRNA was able to mediate specific 
gene silencing. The primary reason for these observations was that these cell systems 

25 lack the generalised responses to dsRNA. These results were encouraging but placed 
particular limitations on the utility of this approach in' differentiated mammalian cells. 

Following on from observations that the products of the Dicer enzyme could 
mediate RNAi in Drosophila embryo extracts, it was then demonstrated that 
chemically synthesised 21 bp siRNAs could be used in a wide range of human and 

30 mouse cell lines to induce gene silencing (Elbashir, S. M., Harborth, J., Lendeckel, W., 
Yalcin, A., Weber, K., and Tuschl, T. (2001) Nature 411(6836), 494-8; Caplen, N.J., 
Parrish, S., Imani, F., Fire, A., and Morgan, R.A. (2001) Proa Natl. Acad Sci. 98, 9742- 
9747). This approach for transiently controlling the expression of a wide range of 
different target genes has been demonstrated and is becoming the method of choice 

35 for determining gene function in mammalian cells (Hsu, J.Y., Reimann, J. D. R., 
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Sorensen, CS., Lucas, J., and Jackson, P. K (2002) Nature Cell Biol 4, 358-366; 
• Thompson, B v Tonwsley, F., Rosin- Arbesfeld, R., Muisi, H., and Bienz, M. (2002) 
Nature Cell Biol 4> 367-373). One of the limitations associated with these synthetic 
dsRNA strategies is the transient nature of the suppressive effect induced by the • 
5 dsRNA. 

More recently, it has been shown that mammalian cells contain a very large 
group of small RNAs called microRNAs which are postulated to be transcribed as 
hairpin RNA precursors that are processed by Dicer to produce the mature 21 base 
forms (Lagos-Quintana, M., Rauhut, R v Lendeckel, W., and Tuschl, T. (2001) Science 

10 294, 853-858; Lau, N.C, lim, L.P., Weinstein, E.G., and Bartel, D.P. (2001) Science 294, 
858-862; Lee, R.C. and Ambros, V. (2001) Science29£, 862-864). Several groups have 
exploited this naturally occurring biological mechanism to show that short hairpin 
RNAs (shRNAs) can induce specific gene silencing in mammalian cells (Paddison, P.J., 
Caudy, A A., Bernstein, E., Hannon, G.J., and Conklin, D.S. (2002) Genes &Devl6, 

15 948-958; Brummelkamp, T.R., Bernards, R., and Agami, R. (2002) Science 296, 550-553; 
Sui, G., Soohoo, C, Affar, E., Gay, F., Shi, Y., Forrester, W. C, and Shi, Y. (2002) Proc 
Natl Acad Sci 99, 5515-20; Yu, J., DeRuiter, SX., and Turner, D.L. (2002) Proc Natl 
AcadSci99, 6047-52). Furthermore, expression cassettes have been developed using 
the endogenous U6 snRNA or HI promoters to drive expression of sequence-specific 

20 shRNAs that can regulate gene expression both transiently and stably in mammalian 
cells via RNAi (Paddison, P.J., Caudy, A.A., Bernstein, E., Hannon, G.J., and Conklin, 
D.S. (2002) Genes &Devl6, 948-958; Brummelkamp, T.R., Bernards, R., and Agami, R. 
(2002) Science 296:550-553). ShRNAs produced from these expression cassettes were 
processed by Dicer to 21 bp siRNAs which are believed to be the effectors of gene 

25 silencing. It is anticipated that these cassettes will be useful for reverse genetic 
approaches in mammalian cells and transgenic mice to better understand gene 
function, and also as therapeutics. 

A major limitation with the state of the art for RNAi in mammalian cells is the 
lack of any strategy for using RNAi knockdowns in a forward genetic approach to 

30 identify new genes involved in cellular processes or different human diseases. At 
present, synthetic siRNAs or RNAi expression constructs are designed on a gene-by- 
gene basis limiting the utility of these strategies for both generating and screening 
genome-wide RNAi expression libraries. The present invention provides methods 
which enable the production of RNAi libraries. 

35 
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SUMMARY OF THE INVENTION IB 

©> 

=H 

In a first aspect the present invention provides a method of produdng a DNA 3> 
molecule wherein mRNA transcribed from the DNA molecule forms hairpin RNA ^ 
5 (hRNA), the method comprising: ®T 
synthesizing a first DNA strand comprising in order a first sequence, a random 35 
sequence and a second sequence, wherein the second sequence has a 3' end and a 5' fsj 
end such that nucleotides at the 3' end of the second sequence are complementary to (O 
nucleotides at the 5' end of the second sequence such that the second sequence forms a ^ 
10 stem loop; «^ 
synthesizing a complementary DNA strand extending from the stem loop using 
a DNA polymerase, the complementary DNA strand being complementary to the first 
region and the random sequence so as to form hairpin DNA; 

denaturing the hairpin DNA to form a single DNA strand; 
15 adding a primer having the same sequence as the first region and DNA 

polymerase to synthesize double stranded DNA. 

In a second aspect the present invention provides a method of preparing an 
expression vector, expression of which produces dsRNA, the method comprising: 

synthesizing a first DNA strand comprising in order at least four consecutive 
20 adenosine nucleotides, a random sequence, at least four consecutive thymidine 
nucleotides and a primer binding site; 

annealing a primer to the primer binding site and synthesizing a second DNA 
strand which is substantially complementary to the first DNA strand and forms 
double stranded DNA; and 
25 cloning the double stranded DNA into an expression vector between two 

convergent promoters. 

In a third aspect the present invention provides a method for determining a 
function of a gene, the method comprising: 

synthesizing a first DNA strand comprising in order a first sequence, a 
30 random sequence and a second sequence, wherein the second sequence has a 3* end 
and a 5* end such that nucleotides at the 3' end of the second sequence are 
complementary to nucleotides at the 5* end of the second sequence such that the 
second sequence forms a stem loop; 
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synthesizing a complementary DNA strand extending from the stem loop using 
a DNA polymerase, the complementary DNA strand being complementary to the first 
region and the random sequence so as to form hairpin DNA; 

denaturing the hairpin DNA to form a single DNA strand; 
5 adding a primer having the same sequence as the first region and DNA 

polymerase to synthesize double stranded DNA; 

cloning the double stranded DNA into an expression vector wherein the double 
stranded DNA is under the control of a promoter; 

transfecting an effective amount of the expression vector into a cell under 
10 conditions permitting transcription of the double stranded DNA to produce a 
transfected cell; 

detecting one or more changes selected from morphological changes and 
behavioural changes in the transfected cell relative to a control cell. 

In a fourth aspect the present invention provides a method for determining a 
15 function of a gene, the method comprising: 

synthesizing a first DNA strand comprising in order at least four consecutive 
adenosine nucleotides, a random sequence, at least four consecutive thymidine 
nucleotides and a primer binding site; 

annealing a primer to tine primer binding site and synthesizing a second DNA 
20 strand which is substantially complementary to the first DNA strand and forms 
double stranded DNA; and 

cloning the double stranded DNA into an expression vector between two 
convergent promoters; 

transfecting an effective amount of the expression vector into a cell under 
25 conditions favouring transcription of the double stranded DNA to produce a 
transfected cell; 

detecting one or more changes selected from morphological changes and 
behavioural changes in the transfected cell relative to a control cell. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



Fig* 1 Generation of a random shRNA library. 

Fig. 2 Generation of a random siRNA library. 

5 Fig. 3 Method for constructing genome-specific shRNA and siRNA libraries. 

Fig. 4 Identification of HTV-specific shRNA or siRNA using genetic selections. 

Fig.5 p53 shRNA construction scheme 

DETAILED DESCRIPTION 

10 

The present invention relates to methods which enable production of a library 
of shRNA or siRNA genes encoding shRNA or siRNAs that are capable of recognising 
all target mRNA sites to identify, isolate and characterise unknown and known genes 
that contribute to a specific cellular phenotype or are modified by specific stimuli. 

15 These expression libraries are designed to suppress the expression of a target gene and 
based on the sequence of the encoded shRNA or siRNA identify the target gene 
responsible for the change in cellular phenotype. This method requires the 
construction of random shRNA and siRNA expression libraries that contain inserts 
encoding RNA sequences that form double-stranded RNA via intramolecular or 

20 intermolecular hybridisation in vivo, respectively! 

The use of dsRNA as a mediator has distinct advantages over hammerhead and 
hairpin ribozymes including the presence of a natural cellular protein complex 
(termed RISC) that binds the expressed dsRNA and mediates interaction with the 
target mRNA and cleavage of the target mRNA. 

25 In a first aspect the present invention provides a method of producing a DNA 

molecule wherein mRNA transcribed from the DNA molecule forms hairpin RNA 
(hRNA), the method comprising: 

synthesizing a first DNA strand comprising in order a first sequence, a random 
sequence and a second sequence, wherein the second sequence has a 3' end and a 5' 

30 end such that nucleotides at the 3' end of the second sequence are complementary to 
nucleotides at the 5* end of the second sequence such that the second sequence forms a 
stem loop; 

synthesizing a complementary DNA strand extending from the stem loop using 
a DNA polymerase, the complementary DNA strand being complementary to the first 
35 region and the random sequence so as to form hairpin DNA; 
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denaturing the hairpin DNA to form a single DN A strand; JTI 

adding a primer having the same sequence as the first region and DNA & ^ 

polymerase to synthesize double stranded DNA. ^ 

In a preferred embodiment a deoxyuracil nucleotide is included in the first ^ 
5 sequence and prior to addition of the primer the single DNA strand is depurinated j£ 
with uracil nucleotide glycosylase and P-elimated. CE 

In a preferred embodiment the double stranded DNA is cloned into an 
expression vector. More preferably the double stranded DNA is cloned into an ^ 
expression vector wherein the double stranded DNA is under the control of a © 
10 promoter. 

In a preferred embodiment the first DNA strand comprises a restriction 
enzyme site. 

Delivery and transcription of the expression vectors of the present invention in 
a host cell provides a shRNA specific for a target mRNA having complementarity 
15 with the double-stranded RNA region. The shRNAs of the invention have been 
shown to be effective modifiers of gene expression. 

Preferably the random sequence is about 19 to about 30 base pairs in length. 
More preferably the random sequence is from 19 to 25 base pairs in length. Most 
preferably the random sequence is 19 base pairs in length. 
20 The random sequence may be synthetically generated by random insertion of 

nucleotides during synthesis or by randomly digesting a viral or pathogenic genome 
to fragment the starting DNA. Random digestion of a viral or pathogenic genome 
may be achieved by techniques known to those of skill in the art, such as DNAse I 
digestion. Synthetic sequences may be generated chemically according to known 
25 methods such as the solid phase phosphoramidite triester method described by 
Beaucage and Caruthers (1981) Tetrahedron Letts. 22(20:1859-1862, e.g. using an 
automated synthesiser as described in Needham-VanDevanter et al (1984) Nucleic 
Acids Res., 12:6159-6168. Purification of the molecule, where necessary, is typically 
performed by either gel electrophoresis or by anion-exchange HPLC as described in 
30 Pearson and Regnier (1983) J. Chrom. 255:137-149. The sequence can be verified using 
the chemical degradation method of Maxam and Gilbert (1980) in Grossman and 
Moldave (eds.) Academic Press, New York, Methods in Enzymology 65:499-560. 

As used herein, the term "complementary" is used in reference to 
"polynucleotides" and oligonucleotides" (which are interchangeable terms that refer to 
35 a sequence of nucleotides) related by the base pairing rules. For example, the 
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sequence 5-CTGAG-3' is complementary to the sequence 5 - CTCAG-3'. 
Complementarity can be partial or total. Partial complementarity is where one or 
more nucleic acid bases is not matched according to the base pairing rules. Total or 
complete complementarity is where each and every nucleic acid base is matched with 
5 another base according to base pairing rules. The degree of complementarity between 
nucleic acid strands has significant effects on the efficiency and strength of 
hybridisation between nucleic acid strands. 

The term "loop" refers to an unpaired secondary structure in an RNA sequence 
in which a single-stranded RNA sequence is flanked by RNA sequences which are 
10 capable of pairing with each other to form a "stem" structure. The term "unpaired" 

when made in reference to RNA refers to a secondary structure in an RNA sequence in 
which RNA is single-stranded and is flanked by RNA sequences which are incapable 
of pairing with each other, but which are capable of pairing with other sequences. 
Loop structures of any length and any sequence are contemplated to be within the 
15 scope of this invention. Computer programs for the prediction of RNA secondary 
structure formation are known in the art and include, for example, "RNAFOLD" 
described in Hofacker et al. (1994) Monatshefte R Chemie 125:167-188; McCaskill 
(1990) Biopolymers 29:1105-1119 and "DNASIS" (Hitachi). 

The term "expression vector" as used herein refers to a recombinant DNA 
20 molecule containing a desired coding sequence and appropriate nucleic acid 

sequences necessary for the expression of the operably linked coding sequence in a 
host organism. Nucleic acid sequences necessary for expression in eukaryotic cells 
usually include a promoter and termination and polyadenylation signals. In a 
preferred embodiment the expression vector also incorporates stabilisation elements 
25 into the expressed RNA to increase the stability of the RNA. As used herein, the term 
"vector" is used in reference to nucleic acid molecules that transfer DNA segments) 
from one cell to another. Vector includes plasmids, viruses, retrotransposons and 
cosmids." 

Preferably the double stranded DNA is cloned into an expression vector 
30 suitable for expression in a mammalian cell. Methods which are well known to those 
skilled in the art can be used to construct expression vectors containing a sequence 
which encodes the RNA expression library. These methods include in vitro 
recombinant DNA techniques, synthetic techniques and in vivo recombination or 
genetic recombination. Such techniques are described in Sambrook et al (1989) 
35 Molecular Cloning, A laboratory Manual, Cold Spring Harbor Press, Plainview N.Y. 



* • • 



10 

141360837 



> 



so 
m 

and Asubel F Metal (1989) Current Protocols in Molecular Biology, John Wiley & 
Sons, New York N.Y. 

As used herein, the term "promoter" refers to a single promoter sequence as <^ 
well as to a plurality (i.e., one or more) of promoter sequences which are operably 
5 linked to each other and to at least one DNA sequence of interest. Promoters consist 

of short arrays of DNA sequences that interact specifically with cellular proteins |— 
involved in transcription (Maniatis T. et al., Science 236:1237 (1987). Promoter ^ 
elements have been isolated from a variety of eukaryotic sources including genes in 
plant, yeast, insect and mammalian cells and viruses. The selection of a particular "Q 
10 promoter depends on what cell type is to be used to express the DNA sequence of 

interest. The promoter also optionally includes distal enhancer or repressor elements 
which can be located as much as. several thousand base pairs from the site of 
transcription. The promoter may be constitutive, such as a promoter active under 
most environmental conditions or stages of development or the promoter may be 
15 inducible, and respond to, for example, an extracellular stimulus. 

Efficient expression of recombinant DNA sequences in eukaryotic cells requires 
expression of signals directing the efficient termination and polyadenylation of the 
resulting transcript. Transcription termination signals are generally found 
downstream of the polyadenylation signal and are generally a few hundred 
20 nucleotides in length. 

In a preferred embodiment the double stranded DNA is cloned into an 
expression vector under the control of a U6 snRNA, HI or T7 promoter. More 
preferably the double stranded DNA is cloned in to an expression vector under the 
control of a U6 snRNA promoter. 
25 Synthesis of the second DNA strand may be achieved using second strand 

synthesis techniques well known to those of skill in the art for synthesizing a second 
strand of DNA from a first strand of DNA, for example utilizing a DNA polymerase 
such as AmpliTaq DNA polymerase (Perkin Elmer). Suitable techniques for second 
strand synthesis may be as set out in Sambrook et al (1989) Molecular Cloning, A 
30 laboratory Manual, Cold Spring Harbor Press, Plainview N.Y. and Asubel F M et al 
(1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York N.Y. 

In a second aspect the present invention provides a method of preparing an 
expression vector, expression of which produces dsRNA, the method comprising: 
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synthesizing a first DNA strand comprising in order at least four consecutive ®^ 
adenosine nucleotides, a random sequence, at least four consecutive thymidine ^ 
nucleotides and a primer binding site; g 
annealing a primer to the primer binding site and synthesizing a second DNA ^ 
5 strand which is substantially complementary to the first DNA strand and forms 
double stranded DNA; and 

cloning the double stranded DNA into an expression vector between two q 

fo 

convergent promoters. 

Transcription from the convergent promoters of two strands of the resident «^ 
10 inserts results in the production of two small complementary RNAs that are capable of 
hybridising to form an siRNA with two to four base overhangs at their 3' ends. 

The expression vector produced according to the methods of the invention are 
useful in identifying the function of a gene or sequence of interest in an organism. 
Preferably the random sequence is about 19 to about 30 base pairs in length. 
15 More preferably the random sequence is from 19 to 25 base pairs in length. Most 
preferably the random sequence is 19 base pairs in length. 

In a preferred embodiment the double stranded DNA is cloned in to an 
expression vector between two convergent U6 snRNA, HI or T7 promoters. More 
preferably the double stranded DNA is cloned in to an expression vector between two 
20 convergent U6 snRNA promoters. 

In a preferred embodiment, the expression vectors prepared according to the 
methods of the first or second aspect are used to transfect a host cell. 

In a third aspect the present invention provides a method for determining a 
function of a gene, the method comprising: 
25 synthesizing a first DNA strand comprising in order a first sequence, a 

random sequence and a second sequence, wherein the second sequence has a 3' end 
and a 5 f end such that nucleotides at the 3' end of the second sequence are 
complementary to nucleotides at the 5' end of the second sequence such that the 
second sequence forms a stem loop; 
30 synthesizing a complementary DNA strand extending from the stem loop using . 

a DNA polymerase, the complementary DNA strand being complementary to the first 
region and the random sequence so as to form hairpin DNA; 

denaturing the hairpin DNA to form a single DNA strand; 
adding a primer having the same sequence as the first region and DNA 
35 polymerase to synthesize double stranded DNA; 
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cloning the double stranded DNA into an expression vector wherein the double ^ 
stranded DNA is under the control of a promoter; ^ 

transfecting an effective amount of the expression vector into a cell under ^ 
conditions permitting transcription of the double stranded DNA to produce a ^ 
5 transfected cell; 

detecting one or more changes selected from morphological changes and |^ 
behavioural changes in the transfected cell relative to a control cell. ^ 

In a fourth aspect the present invention provides a method for determining a © 
function of a gene, the method comprising: «^ 
10 synthesizing a first DNA strand comprising in order at least four consecutive 

adenosine nucleotides, a random sequence, at least four consecutive thymidine 
nucleotides and a primer binding site; 

annealing a primer to the primer binding site and synthesizing a second DNA 
strand which is substantially complementary to the first DNA strand and forms 
15 double stranded DNA; and 

cloning the double stranded DNA into an expression vector between two 
convergent promoters; 

transfecting an effective amount of the expression vector into a cell under 
conditions favouring transcription of the double stranded DNA to produce a 
20 transfected cell; 

detecting one or more changes selected from morphological changes and 
behavioural changes in the transfected cell relative to a control cell. 

The present invention provides methods for the identification of one or more 
functions of a nucleotide sequence in an organism. The methods of the invention 
25 selectively reduce, diminish or destroy the RNA encoded by the targeted coding 
sequence in order to render the RNA non-functional while the targeted gene in the 
host remains intact. These methods therefore employ a "knockdown" strategy to 
determine gene function instead of the traditional "knockout" methods. The invention 
is useful for the rapid identification of, for example, disease related genes which may 
30 be targeted for the treatment or prevention of disease. The methods of the present 
invention also have utility in identifying viral or pathogen-derived genes that play a 
major role in the susceptibility of cells to infection by viruses or pathogens. 

In a preferred embodiment the expression vector is a retroviral expression 

vector. 
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In a preferred embodiment the transfected cell is recovered and the double W 
stranded DNA insert recovered or amplified by, for example, using the polymerase ^ 
chain reaction, re-cloned and subjected to additional enrichment steps. ^ 

In a further preferred embodiment the enriched insert is sequenced and used to jp? 
5 identify potential target genes by, for example, homology searching, or utilised to 

capture the target mRNA. HI 

In a preferred embodiment the expression vector encodes a selectable marker, 
for example an antibiotic resistance gene, for selection of cells transfected with the 
expression vector. More preferably the expression vector encodes the G418 selection ^ 
10 marker. 

The term "transfection" as used herein refers to the introduction of a transgene, 
for example a vector, into a cell. Transfection may be accomplished by a variety of 
means known to the art including calcium phosphate-DNA co-precipitation, DEAE- 
dextran-mediated transfection, polybrene-mediated transfection, electroporation, 

15 microinjection, liposome fusion, lipofection, protoplast fusion, retroviral infection or 
biolistics (i.e., particle bombardment). Transfection may be transient or stable 
transfection. The term "stable transfection" or "stably transfected" refers to the 
introduction and integration of a transgene into the genome of a transfected cell. The 
term "transient transfection" or "transiently transfected" refers to the introduction of 

20 one or more transgenes into a transfected cell in the absence of integration of the 
transgene into the genome of the host cell. 

The term "gene of interest" refers to any gene , the manipulation of which may 
be deemed desirable for any reason, by one of ordinary skill in the art. 

In a preferred embodiment the methods of the present invention for 

25 determining the function of a genomic DNA sequence, a shRNA or siRNA sequence is 
introduced into a cell in order to reduce the amount of RNA expressed by that 
genomic sequence. 

It is desirable to express a sufficient amount of shRNA or siRNA such that 
substantially all the substrate RNA is cleaved. Such substantial abrogation of 

30 substrate RNA expression would facilitate the observation of the effect of depletion of 
gene function in the organism wherein the shRNA or siRNA is expressed. While 
desirable, complete elimination of the substrate RNA is not required by the methods 
of the invention. 
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A "control" cell as used herein includes a cell that is untransfected, has been ^Vj 
mock transf ected, or has been transf ected with an "empty vector" such as an ^ 
expression vector without the double stranded DNA insert. ^ 
Host cells, such as eukaryotic cells, harbouring the expression vectors described >5 
5 above also are provided by this invention. Suitable host cells include, but are not ^ 
limited to, bacterial cells, rat cells, mouse cells and human cells. 

The methods of the invention are useful for determining the function of a gene ^ 
or DNA sequence of interest in an organism by forward genetic approaches including g 
observing the effects of reducing expression of the gene or DNA sequence in the 
10 organism or of a homologous gene or DNA sequence in another organism. For 
example, data presented herein demonstrates that the function of the p53 or EGFP 
gene in HCT116 colon cancer cells or HEK 293 embryonic kidney cells respectively 
may be determined by siRNA or shRNA mediated cleavage of transcripts. 

The types of genetic selections that can be used in a forward genetic approach 
15 with a genome-wide RNAi library includes overcoming cell growth arrest by, for 
example, bypassing p53-mediated growth arrest and apoptosis; identifying new 
targets involved in chemotherapeutic drug resistance such as overcoming 5-FU- 
induced growth arrest, apoptosis and senescence; blocking activated signaling 
pathways, for example, identifying novel positive and negative regulators of signaling 
20 pathways implicated in cancer, such as the TGFP and Wnt pathways; elucidating 
resistance to viral and pathogen infection including genetic screens for genes that 
confer resistance to HIV infection or that interfere with the productive or latent phases 
of the viral life cycle or genetic screens for genes that interfere with the lifecycle of an 
intracellular parasite such as plasmodium; synthetic lethality screens to identify gene 
25 products whose inactivation leads to cell death, particularly in tumor cells deficient for 
either the p53 or pl6/Rb tumor suppression pathways; identifying genes involved in 
metastasis, for example using in vivo assays; identifying optimal siRNAs against 
specific target(s); detecting genes regulating specific promoters; detecting cell cycle , 
regulatory genes, for example using soft agar assays (for anchorage dependent 
30 growth) and minimal medium (for growth factor-independent growth), both of which 
are widely used indicators of cellular transformation in cell culture; identifying 
unknown genes responsible for tumorigenesis such as using bromo-deoxyuridine, a 
nucleoside analog that is toxic to cells undergoing active division. 

The function of a gene of interest in a first organism may be determined by 
35 reducing expression of a gene in a second organism using the methods of the 
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invention where ethical considerations preclude experimentation on the first organism ^ 
of interest. Thus, while ethical considerations preclude depleting gene expression in ^ 
humans in order to determine the function of that gene in a human individual, the ^ 
ability to reduce expression of a gene sequence that is homologous to the human gene gp 

5 sequence in a model organism (e.g. mouse) permits an initial determination of the 

function of the gene. The function of the gene may then be further investigated in ^ 
other model organisms and/ or in clinical trials with human subjects. ^ 
Throughout this specification the word "comprise", or variations such as © 
"comprises" or "comprising", will be understood to imply the inclusion of a stated ^ 

10 element, integer or step, or group of elements, integers or steps, but not the exclusion 
of any other element integer or step, or group of elements, integers or steps. 

All publications mentioned in this specification are herein incorporated by 
reference- Any discussion of documents, acts, materials, devices, articles or the like 
which has been included in the present specification is solely for the purpose of 

15 providing a context for the present invention. It is not to be taken as an admission that 
any or all of these matters form part of the prior art base or were common general 
knowledge in the field relevant to the present invention as it existed in Australia 
before the priority date of each claim of this application. . 

In order that the nature of the present invention may be more clearly 

20 understood preferred forms thereof will now be described with reference to the 
following non-limiting Examples. 

EXAMPLES 

25 Example 1 Generation of random shRNA expression libraries. 

With the current state of the art, gene-expressed shRNAs are designed and 
constructed on a gene-by-gene basis (Paddison, P.J., Caudy, A. A., Bernstein, E., 
Hannon, G.J., and Conklin, D.S. (2002) Genes &Devl6, 948-958; Brummelkamp, T.R., 
Bernards, R., and Agami, R. (2002) Science296, 550-553; Sui, G., Soohoo, C, Affar, E., 

30 Gay, F., Shi, Y., Forrester, W. C, and Shi, Y. (2002) Proc Natl Acad Sd 99, 5515-20; Yu, 
J., DeRuiter, S.L., and Turner, D.L. (2002) Proc Natl Acad Sd 99, 6047-52). To adapt the 
shRNA strategy for utility against all expressed sequences in a mammalian cell, the 
applicants have designed a method for constructing a universal random shRNA 
expression library. The steps involved are outlined in Figure 1. Initially, a pool of 

35 single-stranded DNA is synthesised containing a primer-binding site, a restriction 
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enzyme site (RE1), a 19 base random sequence (N19) and a stem loop (Figure 1 A)- The ^ 
design of this stem loop region is such that the 3' end of the oligonucleotide bases jK* 
pairs with complementary bases 5' to the loop sequence. This results in a substrate 
that can undergo self-priming. Using this feature of the oligonucleotide pool, DNA ^ 

5 polymerase is added to synthesise the complementary strands for both the random jfe 
sequence (C-19) and the primer-binding site (Figure IB). Following denaturation of ^ 
these structures, a primer is added that is the same sequence as the primer-binding site ^ 
and therefore hybridises with the 3' end of the denatured substrate (Figure 1C). "G| 
Addition of DNA polymerase results in second-strand synthesis to produce a fully 

10 double-stranded DNA population (Figure ID) that can then be digested with the 
restriction enzyme specific for the included restriction enzyme site. After digestion, 
this population of DNA inserts is cloned under control of the mammalian U6 snRNA 
promoter to produce a genome-wide shRNA expression library (Figure IE). Upon 
delivery and transcription of these vectors in mammalian cells, a shRNA is produced 

15 specific for a target mRNA having complementarity with the double-stranded RNA 
region (Figure IF). These shRNAs have been shown to be effective inhibitors of gene 
expression in mammalian cells (Paddison, P.J., Caudy, A.A., Bernstein, E., Hannon, 
G.J., and Conklin, DS. (2002) Genes &Devl6, 948-958; Brummelkamp, T.R., Bernards, 
R., and Agami, R. (2002) Sdence296, 550-553; Sui, G„ Soohoo, C, Affar, E., Gay, F., 

20 Shi, Y., Forrester, W. C, and Shi, Y. (2002) Proc Natl Acad Sd 99, 5515-20; Yu, J., 
DeRuiter, S.L., and Turner, D.L. (2002) Proc Natl Acad Sd 99, 6047-52). 

An alternative strategy to construct the DNA insert encoding the shRNA 
involves inclusion of a single deoxyuracil nucleotide between the primer binding site 
and the 19 base random sequence in the pool of single-stranded DNA initially 

25 synthesised. Prior to second strand synthesis this site is depurinated using uracil 

nucleotide glycosylase (UNG) and subsequently P-eliminated by boiling in piperidine. 
Through this process any competing component of the hairpin is removed enabling 
the primer to have free access to its primer binding site during second strand 
synthesis. Primer extension results in the production of a double stranded cassette 

30 that contains a blunt end at the depurination site and a Sail recognition sequence at the 
other end. As this arrangement results in the bottom strand becoming the coding 
strand the hairpin elbow sequence is inverted with respect to the scheme described ■ 
above. This second strategy is shown Figure 5. 
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Example 2 p53-specific shRNA regulates p53 expression and modifies a cellular gj| 
phenotype. 

To demonstrate that the method described in Example 1 could be used to ^ 
generate a shRNA insert encoding a shRNA capable of suppressing a specific target 
5 gene in mammalian cells, we constructed p53-specific and EGFP-specific shRNA. ^ 
expression plasmids. To produce the p53 shRNA insert, the following hairpin |$ 
oligonucleotide is synthesised (Sigma Genosys): Jfl 
R'-nrnA TTC CGTCGA c cig actcc agtggttaatctacgtcg agttc AAG AG act q 
CGAC-3'. The sequences in italics represent the primer-binding site with the Sail site O 
10 indicated by underlining. The first C following the Sail site represents the first «^ 
transcribed base while the bold text indicates the sequence of the 19 nucleotides 
specific for the target site in p53 (Brummelkamp, T.R., Bernards, R., and Agami, R. 
(2002) Science296, 550-553.). The stem loop structure of the hairpin oligonucleotide is 
composed of six complementary bases flanking the underlined loop sequence. The 
15 hairpin oligonucleotide synthesised for the EGFP target gene is 
5'-GTGA 7TC<2Z7(GG40 

CGAC-3'. The small hairpin oligonucleotide is resuspended in 10 mM Tris pH8.8 and 
the concentration adjusted to 100 mM. This solution is then heated briefly (3min) to 
75°C to denature any intermolecular hybrids and cooled to 37 °C for 30 min. At this 
20 temperature the partial hairpin structure of the oligonucleotide is permitted to form 
through intramolecular hybridisation of its complementary arms. The annealed 
hairpin is then stored at 4°C for at least 16 hours. Then 500 pmol of the partial hairpin 
is transferred to a solution of 67 mM Tris-HCl (pH 8.8),16.6mM (NH4)2SC>4* 6.7 mM 
MgCl2, 0.8 units of Ampli TaqVNA polymerase (Perkin Elmer) and 300 each of 
25 dGTP, dATP, dTTP, dCTP and incubated a 37 °C for 30 min. .In some reactions a trace 
of [oc- 32 P]dCTP is included to enable monitoring of the reaction by subsequent 
electrophoresis on a 16% denaturing polyacrylamide gel. After this interval, 5 nmol of 
the hairpin specific primer (5'-GTGATTCCGTCGACC-3') is added (in lXreaction 
buffer) and the solution incubated at 95 °C for 30 s, 50 °C for 10 min and 72 °C for 

30 another 10 min. The sample is then phenol extracted and ethanol precipitated in 0.3 M 
NaOAc and 2 volumes of ethanol before being redissolved and further purified by 
electrophoresed on a 2% agarose gel, band excision and recovery using the Gene Clean 
kit (BiolOl). The purified extension product is redissolved in 10 mM Tris (pH 7.5) and 
digested with 10U of Sail restriction endonuclease at 37 °C for 90 min then 

35 dephosphorylated by the addition of 10U of calf intestinal phosphatase. Following 
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second-strand synthesis, the double-stranded shRNA inserts are digested with Sail 
and ligated to pGUTLESS(U6+l) and pGUTLESS(U6+27), precut with Sail and Xbal " 
and dephosphorylated, tinder control of the human U6 promoter. The expression <^ 
vectors pGUTLESS(U6+l) and pGUTLESS(U6+27) are constructed by subdoning the 
5 U6+1 and U6+27 expression cassettes from pTZU6+l and pAVU6+ 27 (Paul, CP., 
- Good, P.D., Winer, I, and Engelke, D.R. (2002) Nature Biotech 20, 505-508), 

respectively, as BamHI-Hindm fragments into the pd4EGFP-Nl (Clontech) previously 
deleted for the CMV immediate early promoter, dEGFP ORF and the SV40 polyA q 

signal. ^ 
10 Each of the pGUTLESS(U6+l) and pGUTLESS(U6+27) derivative piasmids 

encoding the EGFP or p53-specific shRNAs can be tested in mammalian cells. For 
EGFP, the U6+1 and U6+27 expression vectors are co-transfected into HEK 293 cells 
with EGFP-N1 (target plasmid) and pSVP (transfection control). At 48 hours post- 
transfection, cells are assayed for level of EGFP expression using FACs analysis. 
15 Alternatively, the EGFP shRNA-encoding vectors can be delivered to a HEK 293 cell 
line stably expressing the dEGFP target gene. Following selection in G418, the pooled 
population or selected clones can be analysed for EGFP expression as above. To test 
the p53-specific shRNA expression piasmids, each is linearised and transfected into 
HCT116 colon cancer cells. Following selection in G418 for 14 days, pooled 
20 populations are exposed to 0 *iM and 200 |iM 5-fluorouracil (5-FU) for 48 hours at 
which point p53 protein levels are examined using Western analysis. 



Example 3 Screening random shRNA expression libraries. 

One advantage of screening a genome-wide random RNAi expression library is 

25 the ability to identify effective targets that are linked to the change in cellular 

phenotype using a genetic screen. To examine the selectivity and sensitivity of such a 
strategy, the p53-specific shRNA expression plasmid is diluted 1:10?, 1:10 s , 1:10 4 , 1:10 s 
and 1:10 6 with vector alone. Each of these plasmid mixes is delivered to HCT116 cells 
as above and Neo R cells selected in the presence of G418. Following selection, the 

30 population is then exposed to 200 |xM 5-FU for 48 hrs, and clones recovered in 

standard growth medium. In this genetic selection, p53 wild type cells are sensitive to 
the chemotherapeutic drug 5-FU and are unable to undergo clonal growth after 
exposure. In contrast, cells showing reduced p53 are resistant and retain the ability to 
form clones following selection. The 5-FU resistant clones from each plasmid mix are 

35 recovered and screened for the p53-specific shRNA expression plasmid using a rapid 
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PCR screen. Alternatively, the p53-specific shRNA expression plasmids can be 

linearised and transfected into HT29 colon cancer cells stably expressing a ^ 

temperature-sensitive p53 gene (McKay, B.C., Chen, F., Perumalswami, C. R., Zhang, ^ 
F., and Ljungman, M. (2000) MolBiol Celhl, 2543-51). At 38 °C, these cells do not 

express active p53 and thus grow normally. However, when shifted to 32 °C, these 0£ 

cells over-express functional p53 which causes growth arrest. Thus, cells expressing FTP 

an active p53-specific shRNA at 32 °C will overcome p53-induced growth arrest and O 

form colonies at this temperature. These clones can be recovered and analysed using ^ 



PCR as above. 



Example 4 Generation of random siRN A expression libraries. 

A second method of generating random RNAi expression libraries for genetic 
screening in mammalian cells is summarised in Figure 2. In this approach, a pool of 
single-stranded DNA is synthesised containing four adenosines, followed by a 19 base 

15 random sequence (N19), four thymidines, a restriction enzyme site (KE1), and a 
primer-binding site (Figure 2A). A primer is added to the pool of oligonucleotides 
and second-strand synthesis is directed using DNA polymerase to produce a pool of 
double-stranded DNAs (Figure 2B, C). This population is digested with the 
appropriate restriction enzyme and ligated between two convergent U6 promoters 

20 (Figure 2D), such that upon transcription of the two strands of the resident inserts, two 
small complementary RNAs are produced that are capable of hybridising to form a 
classical siRNA with two to four base overhangs at their 3' ends (Figure 2E). These 
f onns of siRNAs have been shown to mediate specific gene silencing in mammalian 
cells (Miyagishi, M. and Taira, K (2002) Nature Biotech 19, 497-500; Lee, N., Dohjima, 

25 T., Bauer, G., Li, H., Li, M., Ehsani, A., Salvaterra, P., and Rossi, J. (2002) Nature 
tfjofecfi 19, 500-505). 



Example 5 p53-specific siRNA regulates p53 expression and modifies a cellular 
phenotype 

30 One of the key features of the method described for constructing a random 

siRNA expression library is the requirement for an expression vector containing 
convergent U6 promoters (or alternatively, HI or T7 promoters) controlling expression 
of the sense and antisense RNAs encoded by the resident inserts. Toward this end, the 
U6 promoter can be PCR-amplified from pGUTLESS(U6+l) using the following PCR 

35 primers: 5'-GCGCAAGCITATAGGGAAITCGAGCrCGGTA-3' and 



o 
c 



20 39 

141360837 ff| 

if) 

S'-GCGCTCTAGAGGTGTTTCGTCCTTTCCACAA-S' . The PGR products are H 
digested with Hindm (near the 5' end) and Xbal (near the 3' end) and cloned into ^ 
pGUTLESS(U6+l), precut with Xbal and Hindffl, to replace the Pol HI stem terminator 3^ 
region (Paul, C JP., Good, P.D., Winer, I, and Engelke, D.R. (2002) Nature Biotech 20, ^ 

5 505-508). The end result is the plasmid pGUTLESS(2xU6) containing the convergent p§ 
U6 promoter expression cassette. ^ 

To demonstrate the capacity of the convergent U6 promoter expression cassette 
to produce both sense and antisense transcripts, and mediate gene silencing in 
mammalian cells, EGFP- and p53-specific siRNA expression vectors are constructed. 

10 In the case of the EGFP target, the following oligonucleotides are synthesised and 
annealed: 5'- TCGACAAAAACGGC AAGCTG ACCCTG AAG 1 1 i l T-3' and 
5 , -CTAGAAAAACTTCAGGGTCAGCTTGCCGTTTTTG-3 , . For the p53-specific 
insert, the following oligonucleotides are synthesised and annealed: 
5'-TCGACAAAAAGACTCCAGTGGTTAATCTAC and 5'- 

15 CT AG AAA AATAG ATTAACCACTGG AGTCTTTTTG-3' . Each of these inserts is 
cloned into pGUTLESS(2xU6), precut with Sail and Xbal, between the convergent U6 
promoters to produce the siRNA expression vectors. Each of these expression 
plasmids is tested as described above for the EGFP- and p53-specific shRNA plasmids. 

20 Example 6 Construction of target gene and genome (viral, pathogen)-specific 
shRNA and siRNA libraries 

The strategies described above allow the production of RNAi expression 
libraries that contain dsRNA genetic inhibitors for each of the expressed genes of any 
genome including mammalian cells. These same libraries also have utility for 

25 identifying both host genes and viral or pathogen-derived genes that play a major role 
in the susceptibility of cells to infection by viruses and pathogens. The described 
methods can be modified to construct RNAi expression libraries restricted to a specific 
viral or pathogen genome or to a limited number of targets genes. The latter 
application is particularly relevant for probing gene function of up- or down-regulated 

30 genes identified in large-scale microarray or subtractive hybridisation experiments 
where only a subset of genes is implicated in the phenotype under investigation. 
Figure 3 summarises the strategy for constructing target gene(s) and genome-specific 
shRNA and siRNA expression libraries. In the initial step, the target gene(s) or viral or 
pathogenic genome is treated with DNAsel to fragment the starting DNA into 19-29 

35 bp fragments (Figure 3A). To construct a shRNA expression library, the pool of DNA 
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fragments is ligated to a universal hairpin sequence and all DNA fragments containing 
a single hairpin linker are isolated (Figure 3B). A dsDNA adaptor (containing a 
primer-binding site) is then ligated to the end of these DNAs (that does not contain the 
hairpin linker) and all fragments having a single hairpin linker and dsDNA adaptor 

5 are isolated (Figure 3C). This pool of DNA is then denatured, annealed to the 

universal primer, subjected to second-strand synthesis and then digested and ligated 
under control of the U6 promoter in a mammalian expression plasmid (Figure 3D-F). 
To construct a siRNA expression library, the randomly fragmented 19-29 bp DNAs are 
ligated to a dsDNA adaptor (as used in Figure 3C) and all DNAs containing a single 

10 set of adaptors are isolated (Figure 3G). These DNAs can either be PCR-amplified 
using a primer specific for the ligated adaptors (Figure 3H) or digested directly and 
ligated between convergent U6 promoters (Figure 31). 

Example 7 Identification of HTV therapeutics using HIV-derived shRNA libraries. 

15 Genetic selection assays can be used to screen a HIV-specific RNAi expression 

library for effective RNAi construct that confer resistance to HIV infection or that 
interfere with the productive or latent phases of the viral life cycle. Such genetic 
selection assays using genetic suppressor element libraries have been described 
(Dunn, S.J., Park, S.W., Sharma, V., Raghu, G., Simone, J.M., Tavassoli, R., Young, 

20 L.M., Ortega, M.A., Pan, C-H., Alegre, G.J., Roninson, LB., Lipkina, G., Dayn, A, and 
Holzmayer, T.A. (1999) Gene Therapy 6, 130-137) and are outlined in Figure 4. In one 
assay, chronically infected promyelocyctic HL60 cells, which are 99% CD4 positive 
until induction of latent HIV, can be induced to lose CD4 upon the addition of 
TNFcc (type 4). Expression of an effective HIV-specific shRNA will be expected to 

25 interfere with this induction and result in the retention of CD4 on the cell surface. 
Cells containing effective shRNA constructs can then be separated from the CD4- 
negative population using FACs sorting. These constructs should be effective at 
inhibiting HTV induction in latently infected cells. In a second assay, CEM T4 cells 
infected with replicating HIV display an accumulation of p24 and a reduction of CD4 

30 (Figure 4B). Thus, expression of an effective shRNA construct that interferes with 
• productive infection can be identified by enriching for cells exhibiting the CD4- 
positive and p24rnegative phenotype using FACs, Both of these genetic selection 
systems can identify novel HIV-specific shRNA expressing vectors that could be used 
, as gene therapy against multiple stages of the HIV life cycle. 
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It will be appreciated by persons skilled in the art that numerous variations 
and/ or modifications may be made to the invention as shown in the specific 
embodiments without departing from the spirit or scope of the invention as broadly 
5 described. The present embodiments are, therefore, to be considered in all respects as 
illustrative and not restrictive. 

Dated this 4 th day of September 2002 

Johnson & Johnson Research Pty Ltd 
Patent Attorneys for the Applicant: 

BLAKE DAWSON WALDRON 
PATENT SERVICES 
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Figure 1 . Generation of a random shRNA library 
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Figure 2. Generation of a random siRNA library 
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Figure 3. Method for constructing genome-specific shRNA and siRNA libraries 
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Figure 4. Identification of HIV-specific shRNA or siRNAs using genetic 

selections j£j 
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Transfect CEM T4 cells 

I 

Infect with HIV-1 NIB and sort for CD4 positive and p24 negative cells 
Purify genomic DNA, PCR amplify shRNA inserts and sequence 

Re-test independent shRNA constructs 



5/7 



35 

m 

»4 



1 

i 

S3 

f 

CM 



a, 

i 

o 

■s 
g 



•I 

'co 

t 

cn 



o 

i 

1 
i 

U 
U 



8 



1 



O 

Oh 

I 

D 

| 

S3 



CO 

S 



^ Eh 



as 



to 



u 

I 

O 

8 



i 



(DC 

r 



c 

«< 



6/7 



1 

CO 



CO 



O 

I 

•5 



I 



eg u 

< Eh 



■& 
1 
J 

Oh 

oj 
Q 
«3 

-a 

cn 

CO 

Q> 
60 



I 

O 

u 

I 

8 

EH 

U 
Eh 

a 

u 
a 

is 

a 

s 

a 



s 

CD 

3 

1 



a 



O 
Eh 



I 

> 

U 



ST 



-U 

tn 

tn 
tn 

8 

tn 
u 
-P 



•8 

8 
8 

Eh 
U 



O 

H 
8 

u 
o 

Eh 



a 



u 

cd 
U 

u 



a 

s 

8 

Eh 

1 

S3 

d 

Eh 

a 
S3 



I 

s 

u 

I 

8 



a 

s 

cd 
tn 
a 




CJ 

Eh 
Eh 
U 
Eh 

a 

i 

o 
a 

e 
8 



O 

I 
8 

Eh 



4J 
U 



o 

8 

a 

S3 



s 

I 

5 

1 

Eh 
O 

S3 



u 
•p 

& 
& 

a 



I 



0— 

ITT 

a 
c 

T3 



• - 



7/7 



50) 

m 

w 

1 

IT? 

Q 
o 



o 
a. 

I 

•B 
S 

'a 

1 

<u 

•a 

2 
o, 

9 

c 
w 



u 

I 

g 



u 
tn 

a 



I 

rH 
+ 

CD 

•5 



8 

T3 
O 

a, 
a, 

'8 



I 

o 



i 
I 
I 

CD 

s 



s 



